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Abstract 

We introduce the concept of a class of graphs, or more generally, relational structures, being locally 
tree-decomposable. There are numerous examples of locally tree-decomposable classes, among them the 
class of planar graphs and all classes of bounded valence or of bounded tree-width. We also consider a 
slightly more general concept of a class of structures having bounded local tree-width. 

We show that for each property ip of structures that is definable in first-order logic and for each locally 
tree-decomposable class C of graphs, there is a linear time algorithm deciding whether a given structure 
■A G C has property ip. For classes C of bounded local tree-width, we show that for every fc > 1 there is 
an algorithm that solves the same problem in time 0{n^^^^^'^^ ) (where n is the cardinality of the input 
structure). 

1. Introduction 

It is an important task in the theory of algorithms to find feasible instances of otherwise intractable algo- 
rithmic problems. A notion that has turned out to be extremely useful in this context is that of tree-width of 
a graph. 3-COLORABlLlTY, Hamiltonicity, and many other NP-complete properties of graphs can be 
decided in linear time when restricted to graphs whose tree-width is bounded by a fixed constant (see 
for a survey). 

Courcelle ^ proved a meta-theorem, which easily implies numerous results of the abovementioned 
type: Let w > \ and ip be a property of graphs that is definable in monadic second-order logic. Then <^ 
can be decided in linear time on graphs of tree-width at most w. As a matter of fact, this result does not 
only hold for graphs, but for arbitrary relational structures. Although Courcelle's theorem does not give 
practical algorithms, because the hidden constants are too big, it is still useful since it gives a simple way 
to recognize a property as being linear time decidable on graphs of bounded tree-width. Once this has 
been done, a more refined analysis using the combinatorics of the particular property may yield a practical 
algorithm. 

Though maybe the most successful, bounded tree-width is not the only restriction on graphs that makes 
algorithmic tasks easier Other useful restrictions are planarity or bounded valence. For example, consider 
the problem fc-DOMlNATiNG Set for a fixed fc. (Given a graph Q, is there a set D of at most k vertices of 
Q such that every vertex of Q is either equal or adjacent to a vertex in Dl) To solve fc-DOMlNATiNG Set 
in general, we do not know much better than just trying all 0{n^) candidate sets {n always denotes the 
number of vertices of the input graph). However, on planar graphs fc-DOMlNATiNG Set can be solved in 
time O(ll'^n), and on graphs of valence at most /, it can be solved in time 0{{l + l)'"'n) [p^. 

Unfortunately, the analogue of Courcelle's theorem does not hold for planar graphs or classes of 
bounded valence; 3-COLORABlLlTY is a monadic second-order definable property that remains NP-complete 
when restricted to the class of planar graphs of valence at most 4 [|l8|. Instead of monadic second-order, 
we study the complexity of first-order definable properties. 

A preliminary version appeared in Proceedings of the 26th International Colloquium on Automata, Languages, and Program- 
ming, Lecture Notes in Computer Science 1644, pp. 331-340. © Springer- Verlag 1999 
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Seese was the first to give a meta-theorem in the style of Courcelle's theorem for a more general class 
of structures; in he proved that for every I > 1 and for every first-order definable property of structures 
there is a linear time algorithm that decides whether a given structure of valence at most I has this property. 

An observation that has been used for various algorithms on planar graphs (essentially it goes back to 
Baker [|[|) is that there is a bound on the tree-width of a planar graph only depending on its diameter. A 
different way to see this is that a local neighborhood of a vertex in a planar graph has tree-width bounded 
by a number only depending on the radius of this neighborhood. As a matter of fact, given a planar graph 
Q we can compute in linear time a family of subgraphs of bounded tree-width such that a suitably big 
neighborhood of every vertex of Q is completely contained in one of these subgraphs. 

We call classes of graphs admitting such a covering algorithm locally tree-decomposable (a precise 
definition is given in Section ||). Examples of locally tree-decomposable classes of graphs are all classes 
of bounded genus, bounded valence, and bounded tree-width. The concept easily generalizes to arbitrary 
relational structures. 

Eppstein [ [l2| ] considered a closely related, though slightly weaker concept he called the diameter- 
treewidth property (we call this property bounded local tree-width and refer the reader to Section || for 
the definition). Eppstein proved that the subgraph isomorphism problem for a fixed subgraph Ti, asking 
whether a given graph Q contains Ti, is solvable in linear time when restricted to graphs Q contained in a 
class of graphs that is closed under taking minors and has bounded local tree-width. It is not hard to see 
that every class C of graphs that is closed under taking minors and has bounded local tree-width is locally 



tree-decomposable (cf. Lemma 5.6) 



Thus our main result goes much further: 

Theorem 1.1. Let C be a class of relational structures that is locally tree-decomposable and (p a property 
definable in first-order logic. Then there is a linear time algorithm deciding whether a given structure 
^ e C has property ip. 



It may be worth mentioning that in the terminology of [|27[], our result can be rephrased as follows: 
When restricted to a locally tree-decomposable class of structures, the data complexity of first-order logic 
is in linear time. 

Examples of first-order definable properties ai-e fc-DOMlNATlNG-SET and fc-lNDEPENDENT-SET for a 
fixed fc, H-SUBGRAPH-ISOMORPHISM (Given C/, is H C Ql) and 7i-HOMOMORPHlSM (Given Q, is there 
a homomorphism h : H ^ Ql) for a fixed H, {H, /C) -EXTENSION (Given Q, is every Ti (ZQ contained in 
some K. C Ql) for fixed 7i C /C. Let us also give a few examples of a problems defined on other relational 
structures than graphs. For fc > 1, /c-Set-Cover is the problem of deciding whether a given family T of 
sets has a subfamily S of size at most k such that [jS ^[jT. For d>l, {k, d)-ClRCUlT-SATlSFlABlLlTY 
is the problem of deciding whether a given Boolean circuit of depth at most d has a satisfying assignment 
in which at most k input gates are set to 'true'. Both fc-SET-CovER and (fc, C?)-CIRCUIT-SATISFIABILITY 
can be seen as first-order definable problems on certain relational structures. Thus our theorem implies, 
for example, that fc-SET-CoVER can be solved in linear time for set systems where each element is only 
contained in a bounded number of sets, and that (fc, (i)-ClRCUlT-SATlSFlABlLlTY can be solved in linear 
time for circuits whose underlying graph is planar Of course problems like SUBGRAPH-ISOMORPHISM, 
Homomorphism, Extension can be generalized arbitrary relational structures. 

As a last example, let us consider the problem of evaluating a (Boolean) database query formulated in 
the relational calculus against a relational database. Since relational calculus is the same as first-order logic, 
and relational databases are just finite relational structures, our theorem applies and shows, for example, 
that Boolean relational calculus queries can be evaluated in linear time on databases whose underlying 
graph is planar. As a matter of fact, this last example was one of our main motivation for starting this 
research. It seems that when storing geographical data such as road maps, planar structures come up quite 
naturally. 

Thus our theorem gives a unifying framework for various results solving concrete problems on specific 
locally tree-decomposable classes such as the class of planar graphs. In addition, it yields a number of new 
results of this type. 

Using the same techniques, we prove another theorem that apphes to the even more general context of 
classes of structures of bounded local tree-width: 
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Theorem 1.2. Let C be a class of relational structures of bounded local tree-width and ip a first-order 
definable property. Then for every k > 1 there is an algorithm deciding whether a given structure ^ G C 
has property (p in time 0(n^+(^/'^')). 

The complexity of first-order properties of relational structures has been studied under various aspects. 
It is long known that every first-order property of graphs can be decided in polynomial time, actually in 
ACq A question closer to our theorem is whether deciding first-order properties is fixed-parameter 

tractable, that is, whether there is a fixed c such that every first-order property of finite relational structures 
can be decided in time O(n^). This question has been brought up by Yannakakis [^. The theory of 
fixed-parameter tractability gives some evidence that the answer is no, as has been independently proved 
by Downey, Fellows, Taylor Jll]] and Papadimitriou, Yannakakis [53] (deciding first-order properties is 



AW[l]-complete). Theorem L2 shows that deciding first-order properties of structures in a class of bounded 
local tree-width is fixed-parameter tractable. Furthermore, it has been used in [|l6[ to show that for every 
class C of graphs such that there is some graph that is not a minor of any graph in C, deciding first-order 
properties of graphs in C is fixed-parameter tractable. 

The proofs of our results combine three main ingredients: a refinement of Courcelle's Theorem [|]] 
mentioned above, Gaifman's Theorem [ [l7| ] stating that first-order properties are local, and algorithmic 
techniques based on ideas of Baker [||] and Eppstein [Q. To prove Theorem L2, we also use covering 



techniques due to Awerbuch and Peleg |T]]. 



2. Preliminaries 

A vocabulary is a finite set of relation symbols. Associated with every relation symbol i? is a positive inte- 
ger called the arity of R. In the following, E always denotes a binary relation symbol and r a vocabulary. 

A T-structure A consists of a non-empty set A, called the universe of A, and a relation R-^ C A^ for 
each r-ary relation symbol R £ t. If ^ is a r-structure and B C A, then (B)-^ denotes the substructure 
induced by A on B, that is, the r-structure B with universe B and i?^ R-^ n B"" for every r-ary R E t. 

For instance, we consider graphs as {i?}-structures Q = {G,E^), where the binary relation is 
symmetric and anti-reflexive (i.e. graphs are undirected and loop-free). As another example, we can view 
hypergraphs as {E, P}-structures, where E is binary and P unary. A hypergraph with vertices V and 
hyperedges H C Pow(T^) is modeled by the {E, P}-structure {V U H, {{v, H) \ v e H}, V). 

In this paper we only consider finite structures. Let us remark that all the results of this paper remain 
true if we also admit constants in our structures. We restrict our attention to the relational case because 
constants would not give us additional insights. 

The formulas of first-order logic FO are build up in the usual way from an infinite supply of variables 
denoted by x,y,xi, . . . , the equality symbol = and relation symbols of a vocabulary t, the connectives 
A, V, -1, and the quantifiers V, 3 ranging over the universe of the structure. For example, the first-order 
sentence 

ip := Va;iVa::2Va;3 ((i?a;i2;2 A Exix^ A £^2:2X3) — > 3y{Exiy A Ex2y A Ex^y)) 
says that every triangle of a graph is contained in a K4 (a complete graph on four vertices). The formula 

Px A -'3y3z{-'y = z A Exy A Exz) 

defines the set of all vertices a; of a hypergraph that are contained in at most one hyperedge. 

A free variable in a first-order formula is a variable x not in the scope of a quantifier 3x or Vx. A 
sentence is a formula without free variables. The notation (p{xi , . . . ,Xk) indicates that all free variables of 
the formula (p are among xi, . . . , Xfe; it does not necessarily mean that the variables xi, . . . , all appear 
in (p. For a formula (p{xi, . . . , Xk), a structure A, and ai, ... ,ak G A we write A ^ ^piai, ■ ■ • , a,k) to 
say that A satisfies ip if the variables xi, . . . , are interpreted by the vertices oi , . . . ,ak, respectively. 

Example 2.1. In this example we show how to model the fc-SET-CoVER problem mentioned in the intro- 
duction by a first-order definable problem. We can view a family Tofa sets as the hypergraph whith vertex 
set IJ and edge set JF. 
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Let 



:= 3xi.. . Xk^y(^Py {Eyxi V ... V Eyxk)^ . 



Then the hypergraph corresponding to the family satisfies iph if and only if there exists an iS C of 
cardinality \S\ = k such that U = U 

We often denote tuples (ai , . . . , a^) of elements of a set A by a, and we write a G A instead of a G A''. 
Similarly, we denote tuples of variables by x. 

Our underlying model of computation is the standard RAM-model with addition and subtraction as 
arithmetic operations (cf. [jl|, ^^). In our complexity analysis we use the uniform cost measure. Structures 
are represented on a RAM in a straightforward way by listing all elements of the universe and then all 



tuples in the relations. For details we refer the reader to 1 15 1. We define the size of a r-structure A to be 
1^1 1 := \A\ + X]i?,GT r-ary ^ ' 1^"^!' ^^^^ length of a reasonable representation of A (if we suppress 
details that are inessential for us). 



3 . Gaifman's Theorem 

The Gaifman graph of a r-structure A is the graph Qj\^ with vertex set A and an edge between two 

vertices a, 6 G A if there exists an i? G r and a tuple (ai, . . . , a^) G such that a,b {ai, . . . , ak}- 
The distance d-^{a, b) between two elements a, 5 G A of a structure A is the length of the shortest path in 
Qyl connecting a and b. For r > 1 and a G A we define the r-neighborhood of a in ^ to be N:^{a) := 
{fe G A I d^{a, b) < r}. For a subset B C Awe let N;!^{B) := Ubes N;^ib). 

For every r > there is a first-order formula Sr{x, y) such that for all r-structures A and a, 6 G A we 
have A [= 5r{a, b) <;=4' d-^{a, b) < r. For example, if r = {E, T} consists of a binary and a ternary 
relation symbol, we let 

6a{x,y) -.^{x ^ y) 

5i{x^ y) :=(5o(x, y) V Exy V Eyx V 3z(Txyz V Tyxz V Txzy V Tyzx V Tzxy V Tzyx) 
62{x, y) --^Soix, y) V 5i{x, y) V 3z[5i{x, z) A 5i{z, y)) 

In the following, we write d{x, y) < r instead of 6r{x, y) and d{x, y) > r instead of -^6r{x, y). 

If ifiix) is a first-order formula, then (p^''^^^(x) is the formula obtained from 1^9(2;) by relativizing 
all quantifiers to Nr{x), that is, by replacing every subformula of the form 3yi/j{x, y, z) by 3y{d{x, y) < 
r Aijj{x,y, z)) and every subformula of the form Vy^C^;, J/, ^) by Vy((i(x, < r — > ip{x,y,z)). A formula 
■0(2;) of the form (p^'''^^\x), for some 1^9(2;), is called r-local. The basic property of r-local formulas ^{x) 
is that it only depends on the r-neighborhood of x whether they hold at x or not, that is, for all structures 
^anda G Awehave^ ^ h "0(0). 

Theorem 3.1 (Gaifman [^^). Every first-order sentence is equivalent to a Boolean combination of sen- 
tences of the form 

3xi...3xk{ f\ d{xi,Xj)>2rA f\ ip{xi)), 

l<i<j<k l<i<k 

for suitable r,k > 1 and an r-local ipix). 



4. Tree-width 

A tree is an acyclic graph. A tree-decomposition of a r-structure ^ is a pair (T, {Bt)t£T), where T is a 
tree and {Bt)teT a family of subsets of A (called the blocks of the decomposition) such that 

(1) For every a G A, the set {i G T | a G i?t} is non-empty and connected in T (that is, induces a 
subtree). 
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(2) For every R ^ t and all a e there is a t £ T such that a & Bt- 

The width of a tree-decomposition (T, {Bt)teT) is max{|_Bf | | t G T} — 1. The tree-width tw(^) of A is 
the minimal width of a tree-decomposition of A. 

We occasionally use the following simple fact (cf. [Q). 

Lemma 4.1. Lef w > 1 and r a vocabulary. Then there is a constant c such that for every r-structure A 
of tree-width at most w we have \ \A\ \ < c\A\. 

Bodlaender proved that for each w > 1 there is a linear time algorithm that, given a graph Q, 
either computes a tree-decomposition of Q of width at most w, or rejects Q if tw{Q) > w. This result 
is underlying most of the linear time algorithms on graphs of bounded tree-width. Using the well-known 
fact that a structure A has the same tree-width as its Gaifman graph Qj[, Bodlaender's result can easily be 
extended to arbitrary relational structures. 

Recall Courcelle's theorem that we mentioned in the introduction: 

Theorem 4.2 (Courcelle Let w > 1. Then for every sentence tp of monadic second-order logic there 
is a linear time algorithm that decides whether a given structure A of tree-width at most w satisfies (p. 

Monadic second-order logic is an extension of first-order logic that also allows quantification over sets. 
Using known techniques for algorithms on graphs of bounded tree-width, it is not hard to prove the 
following lemma (see [|5]|). We are only going to use the first-order version of the lemma later. 

Lemma 4.3. Let w > 1. Then for every formula ip{x) of monadic second-order logic there is a linear time 
algorithm that, given a graph G of tree-width at most w, computes the set (p{A) := {a G V'~^ \ G \= (p{a)}. 

5. Local Tree-Width 

Definition 5.1. (1) The local tree-width of a structure A is the function Itw^ : N ^ N defined by 

Itw-^(r) := max{tw((iV,-:^(a))) \ a £ A} . 

(2) A class C of structures has bounded local tree-width if there is a function / : N N such that 
Itw-^(r) < fir) for alM G C, r G N. 

Example 5.2. Structures of bounded tree-width. Let ^ be a structure of tree-width at most k. Then 
Itw^(r) < k for all r G N. 

The valence of a structure A is the maximal number of neighbors of a vertex a G A in the Gaifman 
graph i.e. maXa(.A\{b \ {a,b) G E^^}\. 

Example 5.3. Structures of bounded valence. Let ^ be a structure of valence at most I, for an Z > 1. Then 
Itw^(r) < 1(1 - ly-^ for all r G N. 

Example 5.4 (Robertson and Seymour [^3||). Planar Graphs. The class of planar graphs has bounded 
local tree-width. More precisely, for every planar graph G and r > 1 we have Itw'-^(r) < 3r. 

Example 5.5 (Eppstein [p^). Graphs of bounded genus. Let be a surface. Then the class of all graphs 
embeddable in S has bounded local tree-width. More precisely, there is a constant c such that for all graphs 
G embeddable in S and for all r > we have Itw'-^(r) < c • g{S) ■ r. 

Example 5.6. We can view a simplicial complex as a hypergraph whose vertices are the corners of the 
complex. Then it is easy to see that the class of all simplicial complexes homeomorphic to a 2-manifold 
has bounded local tree-width. 
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Recall that a minor of a graph t/ is a graph Ti that is obtained from a subgraph of Q by contracting edges. 
The class of planar graphs, and, more generally, the classes of graphs of bounded genus are examples of 
classes of graphs that are closed under taking minors. Eppstein gave the following nice characterization of 
all classes of graphs of bounded loocal tree-width that are closed under taking minors. An apex graph is a 
graph G that has a vertex v G V'~^ such that G \ {v} is planar. 

Theorem 5.7 (Eppstein g). Let C be a minor-closed class of graphs. Then C has bounded local 
tree-width if, and only if C does not contain all apex graphs. 

This yields further examples of classes of graphs of bounded local tree-width. For example, for every 
n > 1, the class of all graphs that do not contain the graph K^ ^ as a minor has bounded local tree-width. 
{Km,n denotes the complete bipartite graph with parts of size m and n, respectively.) 



Note that a structure has the same local tree-width as its Gaifman graph, so Examples 5.4 and 5.5 
Theorem 



and 



5.7 also give rise to examples of classes of structures of arbitrary vocabularies that have bounded 



local tree-width. 

One of the nice things about bounded local tree-width is that the notion is quite flexible. Think of a 
structure modeling a subway map. The Gaifman graph of this structure will probably be close to planar, but 
there may be some edges crossing. Therefore, it may be the case that planar graph algorithms do not apply, 
although the graph is almost planar On the other hand, the local tree-width of the graph is probably very 
close to that of a planar graph, and we can still use our algorithms for graphs of bounded local tree-width. 



6. Neighborhood and tree covers 

To explore the local tree-likeness of structures of bounded local tree-width we need to cover them by 
structures of small tree-width in a suitable way. The most general approach is to use sparse neighborhood 
covers, as they have been studied, for instance, in p|, pi]]. 

Definition 6.1. Let r, s > 0. An (r, s)-neighborhood cover of a structure ^ is a family TV of subsets of A 
with the following properties: 

(1) For every a ^ A there exists s. N e Af such that Nf{a) C N. 

(2) For every N £ M there exists an a e A such that N C Nf{a). 

We define the size of a family TV of sets to be ||TV|| J^NeJV 1^1- Recall that the size of a r- 
structure ^ is ||y^|| = \A\ + Vp^^ r-ary ''l^'^l- algorithm of the following lemma is an adaptation 
of an algorithm due to Peleg to our situation. We think it is worthwhile to present our version of the 
algorithm in some detail. 

Lemma 6.2 (Peleg Let k > 1. Then there is an algorithm that, given a graph Q and an r > 1, com- 

putes an (r, 2kr)-neighborhood cover M of g of size = 0(|G|i+(i/'=)) in time 0( EwgaA 1 1 I D" 

Proof: The algorithm is described in Figure |l[ It iteratively computes a neighborhood cover TV, maintaining 
a set H of vertices whose r-neighborhood has not yet been covered by a set in TV. In each iteration step 
of the main loop in Lines 3-13, the algorithm picks an arbitrary vertex a G iJ and starts to compute 
increasing neighborhoods of a (in Lines 6-10) until a certain threshold is reached (cf. Line 10). Then it 
adds the computed set N to the cover TV and removes all points whose neighborhood has now been covered 
from H, before it goes to the next iteration of the main loop. This process is repeated until H is empty. 
Now let Qhe a graph, n :— \G\, and r > 1. Let TV be the cover computed by the algorithm. 

Claim L For every a E G there exists a G TV such that Nr{a) C N. 

Proof: An element a is removed from the set H of uncovered elements in Line 12 if it belongs to a set 
L such that TV — Nff{L) has been added to TV. Of course this TV contains N^{a). This proves Claim 1. 

Claim 2. For every TV G TV there exists an a G G such that TV C TV^j,j,(a). 
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Input: Graph Q,r >1 


1. H := 


G 


2. N:= 





3. whUe iJ ^ do 


4. 


choose arbitrary a G H 


5. 


N := {a} 


6. 


do 


7. 


M ■.= N 


8. 


L:=N^{M)nH 


9. 


N := N^{L) 


10. 


while \N\ > ni/'=|M| od 


11. 




12. 


H :=H\L 


13. od 




Output: TV 





Figure 1. 



Proof: We consider the iteration of the main loop that leads to the definition of N. Let a be the 
element chosen in Line 4, and let A^o '■= {d}- Let Z > 1 be the number of times the loop in Lines 6-10 is 
repeated. For 1 < i < Z, let Ni be the value of N after the ith iteration. Then for l<i<Z — Iwe have 
|iVi| > ni/'=|iVi_i|, and therefore \Ni\ > n^l^. Thus I < k. 

Furthermore, it is easy to see that for 1 < i < Z we have Ni C N2i^{a). This imphes Claim 2. 

Claims 1 and 2 show that Af is indeed an (r, 2A;r)-neighborhood cover of Q. The following Claim 3 
shows that the cover is not too large. 

Claim 3. \\Af\\ <n^+('^/''\ 

Proof: For N e M, and let M be the corresponding set that is computed in the last iteration of the 
loop in Lines 6-10 that let to N (i.e. M is the value of after the second but last iteration of the loop). 

We first show that for distinct Ni,N2 £ A/" we have Mi D M2 = 0. To see this, suppose that A''i 
is computed first. Let Hi be the value of H after the iteration of the main loop in which A'^i has been 
computed. Note that for every a G Mi and b e Hi we have dP{a,b) > r. Moreover, M2 ^ ^ 
N^{Hi). Thus Ml n M2 = 0. 

Noting that by the condition of Line 10, for all € A/" we have | A'^l < n^^^\M\, we obtain 

\\M\\= \N\ < n^l^ ^ |M| < n^l^ ■ n. 

The last inequality holds because the M are disjoint subsets of G. This proves Claim 3. 

It remains to estimate the running time of the algorithm. We claim that each iteration of the main loop 
requires time 0{{N)^), for the N added to JV in this iteration. To see this, note that essentially we have to 
do a breadth-first search on starting in a. To compute L in Line 8, we may have to explore some edges 
not contained in (L). However, all these edges belong to {N^{L)) = (N). 

It may seem that to check the condition of Line 10 we need multiplication, which is not available as 
basic operation of a standard RAM. However, before we start the main computation we can produce tables 
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that store the values to and m ■ n for 1 < Z < fc, 1 < m < n in Hnear time on a standard RAM. (We use 
the fact that 

{m + iy= J2 m^'=i^- 
(£1,... ,e,)e{o,i}' 

to inductively compute the tables. Remember that we treat /c as a constant.) Then we can use these tables 
to check the condition of Line 10 in constant time. □ 



Corollary 6.3. Let k,r > 1, t a vocabulary, and C a class of r-stmctures of bounded local tree-width. 
Then there is an algorithm that, given a structure ^ G C, computes an (r, 2kr)-neighborhood cover M of 
Aofsize \\N\\ = 0(|A|i+(i/'=)) mnweO(|A|i+(i/fc)). 



Proof: Since neighborhoods of radius 2kr in structures in C have bounded tree-width, by Lemma 4.1 there 
is a constant c such that for every structure ^ G C, every (r, 2fcr)-neighborhood cover A/" of A, and every 
G A/" we have 

||(A)-^||<c|7V|. (1) 

This implies \ \A\\ < c\\M\\. 

Our algorithm first computes the Gaifman graph C/^ of the input structure A, which is possible in time 
O(ll^ll). Then it computes an an (r, 2 fcr) -neighborhood cover A/" of A of size ||7V|| = 0(|A|i+(i/'=) ). By 
Lemma U and (I), this is possible in time 0(1 |7V| I) = 0{\A\^+^^/*'^). □ 

The following consequence of the proof of the previous corollary is worth being noted: 

Corollary 6.4. Let t be a vocabulary and C be a class of r-structures of bounded local tree-width. Then 
for every k > 1 there is a constant c such that for all structures A d C we have \ \A\ \ < c|^|^"'"*-^/'^^. 

As a matter of fact, a neighborhood cover is more than we need. Often, the following weaker notion of 
a tree cover leads to better results. 



Definition 6.5. Let r,w > 0. An (r, w)-tree cover of a structure ^ is a family T of subsets of A with the 
following properties: 

(1) For every a e A there exists a T G T such that Nj-^{a) C T. 

(2) For every T e T we have tw((r>-^) < w. 

Note that an (r, s) -neighborhood cover of a structure A is an (r, ltw-^(s))-tree cover of A. The follow- 
ing lemma is implicit in [ II4 ] : 

Lemma 6.6 (Eppstein [p^). Let r > and C be a class of graphs that is closed under taking minors and 
has bounded local tree-width. Let f : N ^ N be a function bounding the local tree-width of the graphs in 
C. 

Then there is an algorithm! that, given a graph 5 G C, computes an (r, f{2r + l))-tree cover TofQ of 
size \\r\\ = 0{\G\) in time 0{\G\). 



Proof: Let Q ^ C and choose an arbitrary vertex gq G G. For < z < j, let G[i,j] := {a E G \ i < 
d'^{ao,a) <j}. 

We claim that tw((G'[«, j])) < f{j — i + 1). This is immediate if i = or i = 1, because then 
C N^{ao). If « > 1, we simply contract the connected subgraph {G[0,i — 1])^ to a single vertex 
60. We obtain a minor Q' of G, which is also an element of C by our assumption that C is closed under 
taking minors. G' still contains the set G[i, j] as it is, but this set is contained in N^_^^j^{bo). This proves 
the claim. 
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The claim implies that for all r > 0, the family T := {G[i, i + 2r] | i > 0} is an (r, /(2r + l))-tree 
cover of Q of size at most (2r + 1)|G|. On input Q, we can choose an arbitrary gq and then compute this 
tree cover in linear time by breadth-first search. □ 

The existence of a tree-cover of size linear in the size of the structure and a linear time algorithm 
computing such a cover is exactly what we need in our algorithms of the next section. This justifies the 
following definition: 

Definition 6.7. A class C of graphs is locally tree-decomposable if there is a function ^ : N — > N and an 
algorithm that, given a structure ^ e C and an r G N, computes an (r, 5(r))-tree cover of A of size 0(| A|) 
in time 0(1^1)^ 

Examples 6.8. All examples of classes of structures of bounded local tree-width that we gave in Section ^ 
are actually locally tree-decomposable. 

For Example [5.3[ classes of structures of bounded valence, this is trivial: If A is a structure of valence 
I and r > 0, then the family {N:i:!^{a) \ a G A} is a n (r , l{l — lY~^)-tree cover of A 

For all other examples, it follows from Lemma p. 4 

The following proposition is an immediate consequence of the definition of locally tree-decomposable 
classes of structures: 

Proposition 6.9. Let t be a vocabulary and C be a locally tree-decomposable class of T-structures. Then 
there is a constant c such that for all structures A £ C we have \ \A\ \ < c\A\. 



We close this section with an example showing that the analogue of Proposition 5.9 for classes of 



bounded local tree-width is wrong. Remember Corollary 5.4, though. 



Example 6.10. We construct a class C of graphs of bounded local tree-width such that for every constant 
c there is a graph Q £ C with \ \Q\ \ > c\G\. 

We use the following theorem due to Erdos [Q: For all g,k > 1 there exists a graph of girth greater 
than g and chromatic number greater than k. Remember that the girth g{Q) of a graph Q is the length of 
the shortest cycle in Q and the chromatic number x(0) of Q is the least number of colors needed to color 
the vertices of Q in such a way that no two adjacent vertices have the same color It is easy to see that every 
graph Q with x(0) ^ has a connected subgraph TL with average degree 

yW - 

(cf. ||],p. 98). 

The diameter of a connected graph Q is the number dmm{Q) :— max {d^{a,b) I a,b£G}. 

We inductively construct a family {Qi)i>i of graphs as follows: Qi is the graph consisting of two 
vertices and an edge between them. Suppose now that Qi is already defined. Let Q[j^i be a graph with 
g{Qi+i) > 2diam(^i) + 1 and xiQ'i+i) > 2i + 3. Let Qi+i be a connected subgraph of Q'^^i with 

4tc > 2i + 2. 

Clearly, g{g,+i) > g{g[^^) > 2diam(a0 + L 

Observe that for every r > 1 and every graph Cy, if 2r + 1 < g{Q) then Itw^(r) < 1. Moreover, if Q is 
connected then Itw^(r) — tw{Q) for all r > diam(C?). For every i > 1 and diam{Qi) < r < diam(t/i_|_i), 
we let /(r) :— max{tw(t7i), Itw^'+i (r)}. We claim that Itw^' (r) < /(r) for all i,r > 1. This is obvious 
fori = 1. Fori > 2, we have to distinguish between three cases: If r < diam(C/i_i) < ■^{giQi) — 1), then 
Itw^'(r) < 1 < f{r). If diam(C/i_i) <r< diam{Qi), thenltw^'(r) < f{r) immediately by the definition 
off. If r > diam(aj), thenltw^'(r) = tw(g,) > /(r). 

Thus the class C:={^i|i>l} has bounded local tree- width. On the other hand, for every i > 2 we 
have 115,11 > \E^'\ > i\G^\. 



'The hidden constants in the 0(-)-notation may depend on r. 
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7. The main algorithm 

Throughout this section, we fix a vocabulary t. We shall first prove two lemmas. 

Lemma 7.1. Let C be a class of r-structures of bounded local tree-width and r,w > 1. Then there is an 
algorithm that solves the following problem in time 0{\\T\\): 



Input: Structure ^ G C, (r, ?z;)-tree cover T of A. 
Problem: Compute Kt ■= {a€ A \ Nf{a) C T} for all T e T. 



Proof: Observe that ||^|| = O ( 1 1 T 1 1 ), because by Lemma p] for all T e T we have ||(T}-^|| = 0{\T\). 

Without loss of generality we can assume that ^ is a graph; if not we first compute its Gaifman graph. 
This is possible in time 0(| 1^1 1). Furthermore, we can assume that the universe A of ^ is the set {1, ... ,n} 
(see the appendix of JT^ for details). 

Let T E T, we show how to compute Kt in time 0(|T|). We suppose that T is given as a list 
ai, . . . , a„i of its elements. Our algorithm is displayed in Figure |[ Kt is computed iteratively, during the 
computation the current state of the set is stored in an array K of length n. Note that we do not initialize 
the array to in the beginning (we do not have the time to do that). Instead, we maintain a second "control 
array" F of length m. The jth entry of F is aj, for j = 1 to m. F is initialized to these values in Line 1. 
Then at every stage in the computation, the set of all elements represented by the array K is 

S{K) :={aeA \ K[a] e {1, . . . , to} and r[K[a]] = a}. 

After Line 2 is executed, we have S{K) — T. 

Now the main loop in Lines 3-13 iteratively removes those elements from S{K) whose neighbors are 
not all contained in S{K). Thus after the ith iteration we have 

SiK) = {aeT I Nfia) ^ T}. 

So once we enter Line 15, we have S{K) ~ Kt- Lines 15-17 retrieve this set from the array K. 

Let us analyze the running time of the algorithm. Lines 1 and 2 require time 0{m). To test the condition 
of Lines 7-8 requires constant time for each b. To test the condition of Lines 6-8, we have to step through 
the list of vertices adjacent to aj until either we find a b that does not satisfy the condition or we have 
checked all neighbors. This requires a constant amount of work for every edge with one endpoint Uj and the 
other endpoint in S{K) and an additional constant amount of work in case we find a neighbor not in S{K). 
Thus the execution of the loop in Unes 5-11 requires time 0{m + \E-^ n T^l) < 0(||(T)'^||) — 0{m). 
The loop in Lines 12-14 also requires time 0{m). Thus every iteration of the main loop requires time 
0{m). Since we treat the number r of iterations as constant, the overall time required by Lines 3-15 is 
0{m). Since Kt C T, Lines 16-19 also require time 0{m). □ 



Lemma 7.2. Let C be a class of structures of bounded local tree-width and r,m > 1. Then the following 
problem can be solved in time 0{\A\): 



Input: Structure A € C, set P C A. 
Problem: Decide if there exist ai, . . . , am G P such that d-^{ai, Uj) > r. 



Proof: Let / : N ^ N be a function bounding the local tree-width of the structures in C. 

Let ^ e C and P C A. Our algorithm is displayed in Figure ^ It proceeds in two phases. 

In the first phase (Lines 1-12) it iteratively computes elements ai, . . . ,ai E P, for some i < m, such 
that d'^{ai, aj) > r for 1 < i < j < I and either / = m or for all & G P there is an i < / such that 
b S Nf'lai). If I — m, the algorithm accepts. If / = 0, i.e. P = 0, then it rejects. Otherwise, it goes into 
the second phase (Lines 13-18). 
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Input: A, T 


— \ai, . . . , Gm} ^ /i 




1. 


for j : 


= 1 to m do T[j] :— aj od 




2. 


for j = 


= 1 to 771 do K[aj] :— j od 




3. 


for i = 


= 1 to r do 




4. 




temp 




5. 




for J = 1 to TO do 




6. 




if Qj has a neighbor b such that 




7 

8. 




( K[b](^{l,... ,m} 
or [K[b] e {1, . . . ,?Ti}andr[Ar[6]] b 


j then 


9. 




temp := temp U {aj} 




10. 




ft 




11. 




od 




12. 




for all a G temp do 




13. 




A' [a] 




14. 




od 




15. 


od 






16. 


Kt ■■= 


= 




17. 


for j = 


= 1 to TO do 




18. 




if K[aj] — j then Kt Kt U {a_, } fi 




19. 


od 






Output: Kt 







Figure 2. 



When the algorithm enters Line 13, we have P C N^{{ai, . . . , a;}). Let 7i := {N^{{ai, . . . , a;}). 
Then for all b,b' e P we have ^-^(6, 6') < r d^ib, b') < r, because P C A^;:^({ai, . . . , a;}) 

and thus every path of length at most r between two elements of P must be contained in H. Thus there 
exist bi, . . . ,bm £ P such that d-^{bi, bj) > r if, and only if, there exist 6i, . . . ,bm & P such that 
(f^{bi, bj) > r, i.e. if the condition in Line 14 is satisfied. Thus the algorithm is correct 

To estimate the running time, we note that ||(A'!;:^(ai))-^|| = 0{\N:;^^{ai)\), because C is a class of 
bounded local tree-width. Since we treat r and to as constants. Lines 1-13 require time 0(|A|). It is easy 
to see that tw(?i) < ltw'^(2?r) < f{2lr). Thus the condition in Line 14 can also be checked in time 
0{\A\) by Courcelle's TheoremO □ 



We are now ready to prove our main results. Theorems 1.1 and 1.2. Recall the statements: 



Let C be a class of structures of bounded local tree-width and Lp a sentence of first-order logic. 

(1) For every k > 1 there is an algorithm that decides whether a given structure A ^ C 
satisfies ip in time 0{\A\^+^'^/^'i). 

(2) IfCis locally tree-decomposable, then there is an algorithm that solves the problem in 
time 0{\A\). 
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Input: AgC,PCA 

1. Q ~P 

2. l:=0 

3. while Q 7^ and / < m do 

4. I -.= 1 + 1 

5. choose ai € Q arbitrarily 

6. Q:=Q\Nf{ai) 

7. od 

8. Ml = m then 

9. ACCEPT 

10. else 

11. if ; = then REJECT fi 

12. fl 

13. compute H iV^({ai, . . . , a;}) 

M it{{H)^,P) h3a;i...3a:™(A™i^a:, AAi<,<,<„c?(2;»,a;j) >r) then 

15. ACCEPT 

16. else 

17. REJECT 

18. fl 



Figure 3. 



Proof: We describe the algorithm for (1) and then explain how it has to be modified to obtain (2). 



By Gaifman's Theorem 3.1 , without loss of generality we can assume that ip is of the form 



3xi...3xm{ /\ d{xi,Xj)>2rA f\ tp{xi)), 

l<i<j<ni l<i<m 

for suitably chosen r, m > 1 and an r-local i/;. 

Let k > 1 and / : N ^ N be a function bounding the local tree-width of the structures in C. Let r 
be the vocabulary of the sentence (p; without loss of generality we can assume that all structures in C are 
T-structures. 

Figure Q shows our algorithm. 

To see that the algorithm is correct, note that since ip(x) is r-local we have P={a(^A\A[^ 

So we shall prove that the algorithm can be implemented as an 0(?i^+(^/'^))-algorithm, where n := \ A\ 
is the cardinality of the input structure. 

Line 1 requires time 0(n^+(^/'^)) by Corollary 5.3, Lines 2-4 require time OdlA/"!!) by Lemma 7.1 



For every N € Af, Line 6 requires time 0(|iV|) by Lemma Thus the loop in Lines 5-7 also requires 
time OdlA/"!!). Clearly, Line 8 can be performed in time OdlA/"!!), and the condition in Line 9 can be 
checked in time 0(|v4|) by Lemma 7.2, Thus the overall running time is OdlA/"!!) = 0(n^+'^^/'')). 



It remains to prove (2), but this is very easy now. Instead of a neighborhood cover, in Line 1 of the 
algorithm we compute tree cover of linear size. This can be done in linear time by the definition of a locally 
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Input: Structure A GC 

1. compute an (r, 2fcr)-neighborhood cover A/" of A of size 0(A^+^^/*''') 

2. for all iV e A/" do 

3. compute Kn := {a e N \ Nf{a) C N} 

4. od 

5. for all iV e TV do 

6. compute Pj^ -.^ [a e Kn \ {N)-^ ^ V'(a)}- 

7. od 

& compute P := U 

9. if there are ai, . . . , a,,,. G P such that (i(ai, Aj) > 2r for 1 < i < j < k then 

10. ACCEPT 

11. else 

12. REJECT 

13. fi 



Figure 4. 



tree-decomposable class of graphs. Since the running time of the rest of the algorithm is linear in the size 
of the cover, we obtain a linear time algorithm. □ 



8. Concluding remarlss 



Uniformity. A close look at our proofs shows that actually for each locally tree-decomposable class C 
of structures there is a recursive function / : N ^ N and an algorithm that decides, given a first-order 
sentence ip and a structure A G C, whether A \= f in time 0(/(||(p||)|A|),_where \ \lp\ \ denotes the length 



of the sentence ip. We can obtain an anlogous uniform version of Theorem [L2. 

We stated and proved non-uniform versions of the theorems for the sake of a clearer presentation. 

Dependence on the formula size. Our algorithm heavily depends on the size of the formula (f, roughly the 
hidden multiplicative constant is /c-fold exponential in the length of ip, where k is the number of quantifier- 
alternations in Lp. 

Practical Considerations. The large hidden constants seem to make our algorithms useless for practical 
purposes. Nevertheless, let us briefly discuss a few more practical aspects. 

The main factor contributing to the large constants is the complexity of the formulas, in particular the 
number of quantifier alternations. However, if we think of a database application, we will usually only 
have to handle very simple formulas. As matter of fact, most database queries are so called conjunctive 
queries; they can be defined by first-order formulas of the form 3xi . . . ^^^(ai A . . .A am)- Such formulas 
do not have any quantifier alternation. Moreover, when handling conjunctive queries we can avoid the 
second very costly part hidden in our algorithms, namely the transformation of a formula according to 
Gaifman's theorem. For all we know, such a transformation may blow up the formula size by a non- 
elementary factor. For conjunctive queries, we can avoid Gaifman's theorem and instead use algorithmic 
techniques similar to those in [pjjl. With these techniques, the dependence on the formula size can be 
reduced to a singly exponential factor, which seems acceptable because usually in practice we have to 
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evaluate small formulas (queries) in large structures (databases). The third costly factor is to compute tree- 
decompositions. Bodlaender's linear time algorithm is only of theoretical interest due to very large hidden 
constants. More promising seems to be an algorithm due to Reed [ p2| ] (improving an earlier algorithm 
due to Robertson and Seymour [|4|). For an input graph of tree-width w and size n, this algorithm only 
computes a tree-decomposition of width at most 4w. Its running time is not linear in n (as Bodlaenders), but 
0{n log(n)). However, the algorithm is simple and the hidden constants, though exponentially depending 
on w, do not seem too large for small values of w. Let us also remark that there are much more efficient 
algorithms for computing small width tree-decompositions of planar graphs of small radius JI^. 

Nevertheless, as they stand our results are mostly theoretical. Similarly to Courcelle's Theorem [^, 
their main benefit is to provide a quick and simple way to recognize a property as being linear time com- 
putable on certain classes of graphs. Analyzing the combinatorics of the specific property then, one may 
also find a practical algorithm. 



Further Research. Although Example |6.10 shows that for classes C of bounded local tree-width we 



cannot expect an algorithm deciding a first-order property of structures ^ e C in time 0(|yl|), it does not 
rule out an 0(||^||)-algorithm. To obtain such an algorithm, it would be sufficient to find, for every r > 1, 
a w > 1 and an 0(| |^| |)-algorithm that computes an (r, w)-tiee cover of a structure A (z C. 

As we mentioned, one of the main factors contributing to the heavy dependence of the running time 
of our algorithms on the size of the formula is the transformation into a "local formula" according to 
Gaifman's theorem. Though this transformation is clearly effective, as far as we know its complexity 
has not been studied. We do expect this complexity to be non-elementary, but this does not rule out the 
existence of more efficient algorithms for particular classes of formulas (such as existential formulas) or 
the existence of good heuristics. 

In general, we consider it as one of the main challenges for further research to reduce the dependence 
on the formula size (not only in our results, but also in Courcelle's theorem). For example, is there an 
algorithm that decides, given a first-order sentence cp and a planar graph Q (or a tree, or just a word), 
whether Q \= ipin time 0(2l'^ln'^) for some fixed-constant c? 
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